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© Digital high speed algorithmic data recovery method and apparatus. 

© An all digital data algorithmic recovery method and apparatus which operates at jitter greater than 25% and 
where run length is mors than 1000 bits and which uses self calibrated delay elements to phase align a locally 
generated time ruler reference with the data average transition position to reliably establish the sampling time for 
retrieving data from an incoming binary sequence at the center of the data eye. The phase adjusted time ruler 
signal is used to sample transition positions of the data and the sampled data is statistically analyzed in a state 
machine wherein the time ruler is a broadband signal comprising a first and second base frequency and wherein 
the period of one of said frequencies is 



where F n equals the receiver local clock frequency and Ft equals the frequency of the distant clock. 
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This invention relates to digital algorithmic data recovery for serial data transmit/receive systems and 
particularly to all digital method and apparatus for edge position averaging and data/clock synchronization. 

Reference is made to our copending European patent applications being filed concurrently herewith and 
which relate to aspects of our high data rate digital data/clock recovery system. The applications 

s correspond to Our US applications: (1) "Digital Full Range Rotating Phase Shifter", serial number 
08/021,691; filing date Feb. 24 1993; Inventor Bin Guo: AMD Docket Numbar AS94; and (2) "Digital Variable 
In-Loc* Range Phase Comparator", serial number 08/021,712; fiflng date Feb. 24 1993; inventor Bin Guo; 
AMD Docket Number A9U. 

In addition, reference is made to our copending European patent applications 93304078.4. 93304077.6 

to and 93304079.2 which relate to aspects of the data recovery system described In the present application. 

tn serial data transmit/receive systems, the clock information which provides the correct timing for data 
recovery is generally embedded In the data stream and there is no separate One or channel to cany a Clock 
signal to the receiver. Clock frequency recovery and phase alignment with data is generally performed 
before the received data can bB recovered and de-serialized. Traditionally, a Phase Locked Loop (PLL) is 

»5 employed. In which the frequency of a Voltage Controlled Oscillator (VCO) is adjusted responsive to the 
incoming data frequency. The phase acquisition can be accomplished by the same phase locked loop, or 
by a separate phase locked loop depending on whether a single or a multi-loop system is used. The PLL 
detects the phase error of the recovered clock, or the phase difference between the output of the VCO and 
the incoming data and generates an error signal which is filtered and converted to a control voltage to drive 

ao the VCO to reduce the phase difference. When the phase difference of the two Is eliminated or becomes a 
constant the VCO output is truly a retimed clock since it has an estabfehad, known phase in relation to the 
data. In practice, the data is often contaminated with various types of noises such that timing or phase jitter 
is present The edges ie„ the transitions, in the data stream do not always arrive perfectly In time, but 
rather at different "early" or "late" times, causing timing noise known as phase or timing fitter. Incorrect 

zs phase errors are detected and adjustments to the VCO control are still attempted even if the VCO 
frequency is the same or very close to the data frequency. The PLL Is designed to reduce the effect of 
such jitter sources in the high frequency range by employing low pass filter such that the Jitter of the 
recovered clock is limited and good stability of the loop achieved. However, heretofore this has been 
accomplished by introducing other problems. The control voltage to the VCO Is susceptible to internally 

so generated switching noise which is more serious when the operating frequency increases. The low pass 
filter function often employs components such a3 large values capacitors " and resistors. These and other 
factors have made the monolithic integrated circuit implementation costly, especially when Integrated with 
large scale, dense digital functions. Therefore, an all digital solution is highly desirable especially as feature 
dimensions of integrated circuits becomes smaller such that a complicated logic function may use smaller 

35 die area than even a single capacitor component 

In many prior art digital data recovery approaches, an adjustable bias voltage or current is employed to 
adjust the delay value of a delay unit in a ring oscillator to achieve frequency tuning or to adjust phase to 
match that of the incoming data. In these and other prior approaches, the techniques are generally specified 
for a certain data format, coding format jitter totarance or operation frequency, In our aforesaid copending 

ao application 93304078.4 date recovery is based on a discrete deSay time ruler used for bit Interval 
measurement The time ruler In this earlier invention is generated from a local frequency reference source 
and possesses the correct frequency Information for data recovery. However, the time ruler for subsequent 
bit time interval measurements is Initiated upon the arrival of a data transition. Since the data are assumed 
to have a maximum edge-to-edge Jitter of +/-25% of the bit period, the jitter tolerance for such a system is 

45 reduced to the same range, in our aforesaid pending application 93304077.6 jitter contributed from DCD 
(Duty Cycle Distortion) is detected and compensated using an all digital atgoifthmic method. Peak-to-peak 
jitter of an DCD compensated system with this approach may be reduced to less than +/-25% even whan 
the original unadjusted peak-to- peak jitter Is larger than */-25%. However, the DCD compensation 
technique has a minimum improvement on jitter reduction if the DCD jitter is not dominant A limitation of 

so our previous time ruler data recovery scheme is that it is restricted to short run length NRZI date- For tang 
run length NRZI date Stream, (consecutive data bits without transition), the accumulated timing error will 
eventually cause a failure since a finite error of the time ruler (or the delay value of the time ruler delay 
element) is always present in other words, the jitter tolerance of my earlier system diminishes as the data 
run length increases. Since the design of some coding systems limit the run length of the encoded data, i.e. 

ss 4B/5B code used in Fiber Distributed Data Interface (FDDI), and the SBrtOB code used in FC (Fiber 
Channel), our earlier system was useful. However, other coding systems such as the systems using 
scrambled data, may not guarantee the run length limit A need exists to have a general, all digital, time 
ruler based data recovery scheme which has much larger jitter tolerance than + y-25% of the bit period, 
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and which does not deteriorate for longer run length, irrespective of the data or coding format employed. 

In the present specification we describe a general, all digital data recovery method and scheme, which 
retains its benefits without requiring any specific data or coding format and which can be implemented and 
operated for high speed data transfer, 
s We describe a digital scheme for data recovery that detects and locates the average transition position 
of an incoming serial data in respect to a locally generated time ruler reference to establish a correct phase 
relationship between the incoming data and the local time reference. 

Time-calibrated delay elements, or the time ruler based reference, are used instead of the analog, PLL 
based systems for data recovery. 
10 We describe a method to phase align the locally generated time ruler reference with the data average 
transition position such that the time ruler measurement is initiated at the most appropriate position to 
obtain maximum jitter tolerance. 

We describe a combined, so called "integrated time ruler" which is a signal combination of a local 
reference clock signal from a local frequency source and local timB rulers, where the time rulers are 
is generated from time-regulated delay elements, which technique greatly reduces the accumulated error of 
the time ruler measurement such that the run length of the data can be extended to more than 1000 bits. 

We describe a complete data recovery scheme which is suitable for digital integrated circuit im- 
plementation. 

A signal representing a "compensated broadband time ruler", is phase-aligned with the dense ted and 

30 averaged data transition position. The compensated broadband time ruler is used as the recovered timing 
signal for reading and retrieving the data from an incoming serial binary sequence. Delay elements for 
generating the time ruler can be the same as those described in our aforesaid pending application, 
933040795. The incoming data could be preconditioned by passage through a digital jitter correction and 
signal p re-conditioner as described in our aforesaid pending application 93304077.6. The data could be 

25 sampled while traveling through a delay line comprising two time ruler elements. Each time ruler element 
has a nominal delay value equal to one half of the data bit period. The signal representing the integrated 
time ruler is first passed through a digital up/down phase shifter- The phase adjusted time ruler signal is 
then used to sample or register each data edge or transition position into a set of registers, if the signal of 
the integrated time ruler is properly aligned with the data, the timing edge of the time ruler signal should be 

30 aligned with the position known as the center of the "data Bye". The center of the edge distribution of the 
average transition position is one-half bit interval from the eye center. If, however, the two are not aligned 
properly, a different set of registered data would be presented In the register sets. By observing and 
analyzing the sampled data in the register sets, information regarding the phase relation between the 
compensated integrated time ruler signal and the Incoming data can be acquired. An up/down signal is then 

as generated in a Phase Adjust Decision block to inform a rotating phase shifter to make adjustment such that 
the phase difference is reduced. This process continues until the phase relation of the two reaches a pre- 
specifted condition. 

By combining the reference frequency and a time ruler signal which is real time calibrated to be a 
multiple of the reference frequency to form the compensated broadband time ruler we achieve the ability to 

40 remain properly frequency locked even for large extended run length on the incoming data. The signal 
representing the compensated broadband time ruler is not a regular clock signal since it is not a single 
frequency periodic signal, but rather a signal with two base frequencies: the reference clock frequency, and 
the frequency derived from the reference frequency using the delay time ruler technique. To be able to 
correct for a extended run length, the frequency, or the bit period accuracy, becomes crucial. In the time 

45 ruler data recovery method described in our aforesaid co-pending application 33304078.4 the time lapse of 
the "period" of the time ruler is determined by the regulated delay elements. There is always a finite time 
error m such delay elements, depending also on tile achievable delay adjustment resolution. For the same 
achievable adjustment resolution, the percentile error becomes larger as data rate increases and bit period 
decreases. The accumulated error can become very significant if time rulers with the same time error are 

60 simply "cascaded". The absolute as well as the relative error grows as the run length grows. One corrective 
method for such error would be to "compensate" the error periodically, before it becomes too big. This, of 
course, requires another time ruler or time reference with much higher timing accuracy. In fact, a local 
frequency reference source, which may run only at one Nth of the data frequency, can have much higher 
accuracy. Many systems including the FDD1 standards specify tnat the local frequency source at the 

ss receiver end is allowed to have maximum frequency offset + /-50 ppm (parts per million) with respect to 
the frequency source on the transmitter end (or the incoming data frequency). For +/-50 ppm accuracy 
(which is not difficult to achieve with modem crystal-oscillator technology), tess than %timing error 
would occur for the accumulation of 200 cycles of (he reference dock, or 200 x N bit periods. The 
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accumulated time error from a cascaded time ruler for so many cycles would be much larger. Our solution 
is to use the local reference clock signal (known as the parallel data clock or byte dock) to correct the 
delay generated time ruler. Assume that the reference dock signal is 1/N times the data rate, so that N time 
ruler periods at the incoming data frequency are generated within a reference clock (byte clock) period, and 
s assume that each time ruler period has a timing error of +Te. then at the end of the Nth time ruler period, 
a total time error of +(N x Te) is accumulated. However, if the last time ruler period in each byte is 
modified by the amount of -<N xTe), the time error Is fully compensated. By using our "Compensated 
broadband time ruler" concept the data run length is allowed to exceed thousands of bits without causing 
an error In recovered data, 
to In the accompanying drawings, by way of example only: 

Fig. 1a is a functional diagram of a transceiver providing an all digital data recovery system employing a 
locally generated "Compensated broadband time ruler" and algorithmic data transition averaging- 
Fig. 1b is the block diagram of an alternative embodiment of the system of Fig. la which employs a 
preconditioning block. 

75 Fig. 2a is the block diagram of the Compensated Broadband Time ruler Generator and Regulator. 
Fig. 2b is the timing diagram for Fig. 1a and Rg. 1b. 
Fig. 2c Is the block diagram of a DCD Preconditjoner. 

Fig. 3 is the block and logic diagram of one embodiment of the edge distribution sampler block of 
Figures la and 1b. 

20 Fig. 4a and 4b show timing diagrams and analysis used in the phase adjustment decision function. 
Fig. 5a, 5b, 5c through 8a, 8b and Be are illustrative diagrams analyzing the l-O test method. 
Fig. 9a through 9e are illustrative summary diagrams analyzing the distribution test method for different 
data transition distributions. 

Rg. 10 is a chart showing the truth table combinations for generating the up/down commands in the 
28 phase adjust decision block. 

Rg. 1 1 is the block diagram of an embodiment of an alternative edge distribution sampler. 

Rg. 12 is the schematic for the Bit Serializer of Rg. 2a. 

Fig. 13a 1$ a block diagram for a 1/2 Tb delay element of Rg. 2a. 

Rg. 13b i3 a schematic for the delay cell of the 1/2 Tb delay, 
so Fig. 14a is a block diagram of the Sampling Register of Rg. 2a. 

Rg. 14b is a block diagram of the Ruler Adjustment Decision Circuit of Rg. 2a. 

Rg. 14c rs a timing diagram of the circuit of Rg. 14a. 

Rg. I4d is the logic truth table analysis of the Status Decode Logic of Rg- 14b. 

Rg. iSa is the time zoning analysis of the Adjust Update Control of Rg. 14b. 
as Rg. 1 5b is the logic truth table for Adjust Update Control of Rg. 1 4b. 

Rg. 16 is a block diagram of the Digitally Adjustable Delay. 

Rg. 17 Is the timing diagram for Rg. 16. 

Rg. 18 Is the block diagram for the 360 Degree Phase Shift Range Calibrator. 
Rg. 19 is the block diagram of one embodiment of the Rotating Phase Shift Control. 
40 Rg. 20 is the logic diagram for the register stages of Rg. 19. 

Rg. 21 is the table and chart of equations and the truth table for Rg. 20. 

DETAILED DESCRIPTION 

45 Rg. 1a shows tfm overall block diagram of a data transceiver using locally generated compensated 
broadband time rulers and data transition averaging. 

With reference to Rg. la, to transmit data, the transceiver i receives N-toit parallel data tdaTA on bus 
11, serializes it and transmits out the serialized data SERO on line 12- The transceiver receives the serial 
incoming data SERIN on line 13, recovers the data from the output of D flip-flop 15 using a recovered and 

so retimed signal SBITCK 18' representing a phase-shifted copy of the compensated broadband time ruler on 
fine 16. The deserialized N-brt parallel data as the recovered data RDATA are sent out on line 14. A 
reference frequency source generated clock signal LCLK as the local clock running at 1/N of the data 
frequency is taken in on line 31 . The accuracy of the LCLK or its frequency is specified in respect to the 
clock frequency of the remote transmitter/receiver. A "compensated broadband* time ruler signal BITCK 16. 

ss equivalent to a bit-rate clock, is generated in the compensated broadband Time Ruler Generator and 
Regulator block 30, and output on line 16. This signal is then sent to digital adjustable delay 50, Rg. 16 of 
the Digital Phase Shifter 56 in which phase shifting is performed, and from which the phase adjusted 
integrated time ruler signal SBITCK is output on line 16*. The BITCK signal is generated from a local 

4 
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frequency reference source LCLK which has no certain phase relation with the incoming serial data SERIN 
on fine 13 and thus can not be directly used to read or sample the data The serial incoming data SERIN on 
line 13 may be preconditioned. Fig. 1b, to compensate duty cycle distortion by signal pre-concfitioner 20'. 
so that the preconditioned incoming data is available on line 13'. Alternatively, the serial incoming data 

5 SERIN can be directly applied to the D input of D flip-flop 15 for data recovery, via connector 13" since thi3 
scheme has large jitter tolerance and data can be reliably recovered even in the presence of the duty cycle 
distortion fitter, in the latter case, only the two 1/2 Tb time ruler delay elements 22 and 23 in Ruler Delay 20 
are needed. The serial incoming data on line 13" is sent to the cascaded time ruler delay Z2 and 23. each 
of which delays for a time lapse equal to half of the bit period. The edge distribution sampler 81 samples 

to the level status of nodes 25 through 29 to capture the location of data transitions while the data signal wave 
is traveling in time ruler delays. Different distribution patterns on nodes 25-29 will be observed if the 
sampGng takes place at different phase instances. In this arrangement, thB phase adjusted signal SBITCK 
16* is used to strobe the sampler for capturing the status of the transitions. Since SBITCK IB' is also used 
as the clocking signal for D flip-flop 15 for reading the data on Una 13", the most appropriate phase relation 

fs of SBITCK and the incoming data SERIN is that the SBITCK sampling edge is aligned with the center of the 
data eye. tf no time jitter is present in the incoming data, an data transition should be captured at or In the 
neighborhood of node 27. In other words, if the clocking edge SBITCK is one half bit away from the SERIN 
data transrflon in time, It Is called "phase in lock". In practice, there are always timing jitters associated with 
data edges. Distributed edges should be observed if a certain number of edges are captured. The sampled 

so edge distribution information is sent to the phase adjust decision block 82 via bus 85. The captured 
distribution data are analyzed and decisions are made in block 82 to inform the rotating phase shift control 
56 of the direction of the phase error. If the retimed or phase adjusted clock signal SBITCK is not aligned 
with tha center of the data eye, commands are sent to the rotating phase shift control of the phase shifter 
56, via line 88 UPDNEN (up/down adjustment enaWe) and 89 UPDN (up or down) to cause phase 

25 adjustments In stops. The process continues until phase alignment is achieved, and is resumed whenever a 
pres pacified phase error due to frequency drift occurs. 

tn fig. 2a, a detailed block diagram is given for the Compensated Broadband Time Ruler Generator and 
Time Ruler Regulator 30. The function of the Time ruler Generator is more fully described in pending 
application 93304079.2. A plurality of cascaded adjustable delay elements 21' which each have delay of 

so one-half T b are controlled by the up/down shifter SO' on bus 1000. With reference also to Fig.2b, timing 
diagrams, it is seen how the Local Reference Clock LCLK controls the regulation and calibration of the time 
ruler delay elements 21 such that each delay element 21' exhibits the tame lapse equal to half of the bit 
period. 5X frequency multiplication Is assumed in this example. Pulse generator 34 provides pulse 312, 322, 
etc. at node BT1, 301, at every LCLK rising edge 310, 321, etc., to generated the rising edge of the pulse 

as with width equal to half of the bit period. Pulse generator 34 Fig. 2a is responsive to the output on line 33 
the first time ruter pulse which is subsequently delayed. The pulse generator 34 output is BT1. If the delay 
value of each 1/2 Tb delay element 21' is calibrated to its nominal value, at every bit interval, a delayed 
copy of this pulse Is generated at nodes BT2, 302 through BT5, 305. These pulses are signals responsive 
to the time ruler which assist in determining the transmitted and serialized data frequency. The pulse is 

ao "traveling" In the serial delay line composed of several cascading such time ruler delay elements 21', 
generating pulses in every bit interval at nodes Bt1 through BtS and providing frequency information in our 
data recovery process. Error will accumulate as the number of such consecutive bit interval increases. I call 
this time ruler the "bit rate" time ruler, since it is produced from a "byte rate" reference frequency clock 
signal LCLK and which provides time measurement at bit resolution. If infinitely small delay adjustment 

45 could be achieved in the delay regulation process, or if N such bit rate time ruler lapses equaled exactly 
one byte rate time ruler from which they were generated, then the bit rate ruler could be used a large 
number of times. However, N times Tb is not equal to LCLK period due to the achievable adjustable timing 
resolution. In serial link data transmission and recovery, we use the recovered signal or time ruler to 
"measure" the time lapse in the absence of data transitions. If each time ruler had a 1% timing error, a one 

so bit sEp would occur after 100 bits. This is especially a problem in an all digital data recovery approach 
since only step or finite time or delay adjustment is available. However, the "byte rate" time ruler is very 
accurate and is not dependent on the adjustment resolution. The conventional accuracy of a crystal 
frequency source permits the use of the time measurement for hundreds of consecutive byte cycles without 
accumulating significant out of spec timing error. With reference to Fig. 2b, the signals from BT1, 301 

66 through BT5. 305 are combined in a logic OR circuit to produce a signal BtTCK 16. 1 call the ORed output a 
"Compensated Broadband" time ruler, BITCK 16. I call it "compensated broadband" because it is a 
combination of two types of time rulers with different base frequency. Note from Rg 2b, that the broadband 
time ruler BITCK 16 acts like a bit rate clock, but is different from a regular bit Clock. In the timing diagram, 

5 
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the cycles of BITCK are 5 in a group. Cycle Tl through T4 are identical in time because they are generated 
from delaying a signal by identical bit delay time ruler elements, white cycle T5 may be different than the 
othBr four if a timing error of the time ruler exists. In a conventional dock, Period T5 would end at the time 
Indicated by time line 323 if It was determined by delaying Signal BT4 by a time ruler delay element. 

s Actually, the rising edge of the 8th bit 333 is generated by edge 322, which is generated by the rising edge 
of the LCLK 31 at 321 . There is a time gap between 323 and 333. which In fact is the accumulated timing 
error of the time rulers generated in period T1 and through T4. In terms of the time "measurement" of this 
example, Wl time rulers are used for determining 4 bits within every 5 consecutive bits, while the byte rate 
time ruler is used to determine the rising transaction for the 6th bit As a result, the frequency accuracy is 

10 only limited by the byte rate dock LCLK. With a given frequency offset specification of 1 00 ppm or so, data 
recovery using this type of compensated broadband time ruler can stand a run length of hundreds of byte 
times or thousands of b'rt period times, i-e. periods without a data transition. 

The compensated broadband time ruler BITCK can be used to sample or recover the serial incoming 
data on Bne 13 only if its Clocking edge can be aligned with the center of the data "eye", show In waveform 

is 365. In Fig. 2b, BITCK 16 has to be delayed or phase shifted in the amount of Tdd to produce a phase 
shifted copy of the broadband time ruler SBiTCK 16', the timing edge (rising edge) of which Is aligned with 
the center of the data eye indicated by the dashed lines 344. When this alignment is accomplished, the 
time between the rising edge of the phase shifted SBITCK and the average data transition position should 
be half of the bit period as shown in time interval 366. H the phase adjusted or aligned SBITCK is used to 

M "sample" or to "capture" the data transition positions, a edge distribution centered at half bit time away 
from the sampling instances indicated by clashed line 365 can be expected. 

Fig. 2C is the block diagram of a DCD precond'rtiorter. The function of the DCD compensator 20' is 
more fully described in pending application 93304077.6. 

25 EDGE DISTRIBUTION SAMPLER (FIG. 3) 

In Rg. 3. one embodiment of the edge distribution sampler block 61 from Fig. 1a and its connection to 
the phase adjustment decision block 82 and to the time ruler delay elements 22, 23 are shown. In this 
embodiment, SBITCK 16' is used as the clocking signal for the latches 810, which inputs are from nodes 25 

30 through 26. Signal arrived at node 27 is half bit time later than mat seen at node 25, and signal arrived at 
node 2d is half bit later than that seen at node 27. Node 27 is named W0 because data edge or transitions 
are expected to be captured around there if the SBITCK is aligned with the center of the data eye or they 
are "in lock". Nodes 26 and 28 are taps from the delay element, the significance of their exact location will 
be subsequently discussed. For the purpose of explanation, I assume that there are taps In the middle of 

as the delay element at this time. These five nodes make up several regions In respect to the sample clocking 
edge of SBITCK. Refer also to Fig. 4a for the regions: 

The inner Region (I) - region 101 where a data transition is captured when it is between the two middle 
nodes 2B and 28; 

The Outer Early Region ( OE ) - region 105 where a data transition is captured when H is between 
40 node 25 and 26; 

The Outer Late Region (OL) - region 106 where a data transition is captured when it is between 28 and 

29; 

The Outer Region (O) - the logical OR combination of OE and OL; 

The Early Region ( E) — region 103 where a data transition is captured when it is between node 25 and 

45 Z7; 

The Late Region (L) - the region where a data transition is captured when it is between node 27 and 

29. 

Exclusive OR (XOR) gates 315 are used to determine if a transition is captured in one of the regions. 
For example, if a transition is in the OE region white the SBITCK is sampling, the sampled binary level at 

so output 805 and 806 would be different such that the XOR output OE 821 is a "1 ". If it is "in lock" or the 
sampling dock SBITCK is well aligned with ifiB data eye, there should be only a single transition captured, 
so that 823 or 824 is "1 " and all other XOR outputs are "0"'S- A "1 " in either 821 or 823 will make XE 825 
a "1", meaning that the transition is captured in the "early" region. A "1 " In either 822 or 824 will make XL 
828 a "1". meaning that a transition is captured in the Tate" region. Assuming a transition Is captured in the 

ss early region so XE Is a "1" while XL is a "0". the docking edge of the signal ISBITCK 16" is gated through 
by one of the AND gates 817. and the "early" counter 831 Is incremented by one while the Tate" counter 
832 remains unchanged due to the fact that XL Is a "0" preventing the clocking signal ISBITCK from 
incrementing the late counter 832. After a number of sampling cycles, the "early" counter 831 and the 

6 
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Tate' counter 832 received ttie number of counts equals to the captured early and late transitions 
respectively. The test for finding and registering the differences of the two (called E-L test, meaning total 
early minus total late in a sampling period) are computed in up/down counter 841 and output to the phase 
adjust decision 82 via bus line 87. Trie same analysis applies to the "I" end "O" region for transitions 

s captured in the "inner" and "outer" region respectively. The test for differences in captured transitions for 
the two (called l-O test) is computed in subtractor 1-0 843 which holds the difference for transitions 
captured in Inner and Outer regions. The subtractor B-S B43 holds the differences for B (in both "early" and 
"late" region or in neither early or late) and S (only a single transition is captured). The test for the 
difference of me two is called B-S test and is also sent the to decision 82 via bus line 85'. When transitions 

io are captured in both "E" and "L" region or none are captured in both region, the outputs of XE 825 and XL 
82B are at the same logic level, so XOR 629 output is a zero, otherwise it is a "1 ". Counters B (for both) 
and S (for single) win be incremented accordingly. The contents of the counters and subtracters are the 
edge distribution information needed for the phase adjust decision block 32 to make adjustment decisions- 
Flg 5a, 5b, 5c through 8a, 8b and 8c are diagrams illustrating the edge distribution analysis for the 

is phase adjust decision process. In Fig, 4a. the phase shifted integrated time ruler signal SBITCK 16' is 
shown and the detection regions are defined as described early. Data waveforms are shown in overlapped 
eye diagram format together with histogram information for the edge distributions for rising edges (402) and 
falling edges (403) respectively are shown as an Increased concentration of lines where the histogram has 
higher concentration. Row (A) Rg. 4a & 4b illustrates the "in lock" situation (or 0 degree offset) when the 

20 rising edge of the SBITCK is aligned with the center of the data eye as indicated by line 401 . Observations 
of the distribution in this case reveals with reference to Fig. 4b: 

1. Almost all the transitions are captured within the "P. 101, region- A few edges in the tails 402' of the 
distribution is within the "0" region. The 1-0 test result shown in column ill should take position 
maximum value. 

2s 2. Equal transitions are captured in "E" and "L" region because of the symmetry and the alignment of 
the distribution. E-L test result shown in column 112 should be zero. 

3. Only a single transition can be captured in each sampling, so B-S test In Column 113 should take the 
negative maximum value. 

4. Rising edge distribution represented by the cross hatched curve region 402 is centered at the early 
so ("E") region, so RE (rising early) is much greater than RL (rising late), while FE (falling) i3 much smaller 

than FL (falling late) since the falling edge distribution curve 403 is centered at the late ("L") region. 
The same analysis technique is illustrated by the situation when SBITCK is 90 degree out of phase in 
row (B), 180 degree out of phase in row (C), and -80 (270) degree out of phase in row (D), each in respect 
to the center of the data eye. Note that if It Is 160 degree out of phase, the result of the E-L test is the 

3s same as if it is aligned- Thus this test can not be used to distinguish between 0 degree or 180 degree out 
off phase. However, the l-O number is maximum with a change in sign, and will be a reliable test to 
distinguish the two. It also noted that the result of the l-O test in column 111 Is the same for the event of 80 
degree or -90 degree out of phase situations, thus can not be used to differentiate these two states. This 
information is needed to determine the direction in up/down type of phase adjustment. However, &L test in 

eo column 112 has the maximum opposing number for these two, so the E-L test is obviously the test to 
extricate if the SBITCK is leading or lagging the data. Rom the above analysis, it can be seen that test E-L 
or test 1-0 are a function of the SBITCK phase offset, and the two functions are 90 degrees out of phase- 
In Rg. 5 through 9, variations in width of the "I" region as well as the peak-to-peak value of the 
captured data transition distributions are considered. Fig. 5a, Sb and 5c illustrate the l-O test results when 

4s the width of the "I" region is large, medium and small respectively. The width of the T regions are shown 
in shaded areas. Since the t-O test is primarily for differentiating 0 degree offset and 180 degree offset, 
targe T region width represented by K) curve 851 in Fig. 5a has a greater slope near 180 degrees and is 
much more appropriate for the task than the small "I" region width represented by curve 553 in Rg. 5a Rg. 
8a, 6b and 6c are for the seme situations except that the total data edge distributions are depicted in 

so standard Gaussian Distribution with a medium peak-to-peak value of about + - 30% of the bit period. The 
same conclusion can be reached that large "I" region width should be used. Rg. 7a through 7c show the 
situations when the peak-to- peak value of the total edge distribution is large and Rg. 8 shows the situation 
when the peak-to-peak value of the data edge distribution is small. Summarizing, for all cases, a large "I" 
region width will make the distinction between no offset and 180 degree offset easier. Rg. 8a through 8c are 

55 for the situation when the peak-to-peak value of the edge distribution is small and the same conclusion can 
be reached. Rg. 9a - 9e summarizes the above discussion for different test and peak-to-peak value of the 
edge distributions. 
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Fig. 9a ■ 9e shows edge distributions for three different peak-to-peak values. Curve A. B and C are for 
edge distributions with small, medium and large peak-to-peak values respectively. In Fig. 9b, the result of 
the E-L test as a function of the SBITCK offset In respBct to the mean of the data edge distribution is 
plotted for the above three oases respectively. This tast E-L is also used to determine if the circuit is "in 

s lock". An absolute threshold number d can be specified which is the maximum number allowed for the 
differences between the number of the "early" edges and the number of the "late" edges in a sampling 
period. If the difference is below d, then no adjustment needs to be performed. When the absolute value of 
E-L exceeds d, and "up" or "down" phase shifting of the Integrated time ruler is enabled- The up/down shift 
direction depends on if E-L has positive or negative value. Note also In the figure that the phase 

io alignments for pA, pB and pC are different for the same threshold value d because of different data 
transition distributions. Data edge distribution with smaller peak-to-peak value as curve A has less alignment 
error, pA. Digital filter functions may also be included to reject the effect from temporary phase change 
caused by noise. However, this is accomplished mainly by choosing optimized sampling size which will be 
discussed later. In Fig. 9c, the result of 1-0 test as a function of the phase offset is plotted for the three 

is types of edge distributions. As mentioned before, when the absolute value of E-L is below threshold d, the 
phase relation of the SBITCK and the data Is either dose to 0 degree or In the neighborhood of 180 degree. 
The l-O test is primarily tor differentiating the two, which is not a difficult task since the l-O test has 
opposite maximum value in those two events. Fig. 9d shows plots of the results from B-S test, assuming 
that data has transitions for every bit and there is dominant duty cycle distortion jitter associated with the 

to data. The B-S test does not provide more information regarding the phase alignment than that provided by 
the i-O test It may be used as a double-checking condition- 
Fig. 10, shows the truth table for the various conditions and the corresponding decision control signals 
output of the Phase Adjust Decision block 82, Fig. 1a, are summarized. The E-L, K), and B-S tests have 
predetermined characteristics in each quadrature which can be used to make phase adjust decisions. This 

ss condition differs from ttie analog PLL-based systems, In which the phase error signal from a phase detector 
results immediately in a response In frequency adjustment of the VCO. Low pass filter for smoothing trie 
error signal is necessary In such analog PLL loops to reject high frequency components in the phase error 
signal caused by noise, while retaining the low frequency phase variation information representing real 
frequency or phase drifting, in this digital approach, however, the edge averaging mechanism has inherently 

20 low pass filtering functionality, since phase adjust decision command is not generated based on phase error 
of an individual edge, but rather on an averaged edge position or the average of the phase errors for many 
bits. The decision logic for implementing functions in the table can be relatively simple. Far example, if 
each digitally controlled adjustable phase shift step is very small (delay of a minimum delay unit), a pure 
combinatorial logic may just be enough to map the E-L, 1-0 test outputs condition as the inputs to phase 

35 adjust control outputs UPDNEN and UPDN. Since the amount of phase adjustment allowed for each step is 
small, regardless of the phase error magnitude detected, another type of low pass filtering is embedded in 
the scheme. Additional digital filter functions can also be implemented. For example, not only the values of 
the tests, but also the rate of the change of values can also be counted for making the adjust decision. 
Furthermore, variable value adjustment steps may also be used depending upon the magnitude of the 

40 phase error detected, and on the quadrant of the phase offset A "FS" output signal (Fast/Stow mode 
switch) can also be generated: if the phase difference detected is more than +-90 degrees, or it falls into 
02 or 03, larger phase adjustment step or mutti steps In one adjustment may be used. This is useful during 
the initial lock-in period in which fast adjustment is often needed. 

Fig. 1 1 Is an alternative embodiment of the diagram for the edge distribution sampler which provides a 

45 different way to determine the "in lock" condition. Signals at nodes 25 through 29 are further delayed by 
two unit delays to provide two more sets of sampling points 325 through 329 and 425 through 429. The 
SBITCK 16' is also delayed by a unit delay, so sampling points 325 through 329 is the center group, 
sampling points 25 through 29 becomes the left group and sampling points 425 through 429 is the right 
sampling group. The edge distribution sampling takes place for all three groups and the phasB adjustment 

so process can be paused or the circuit considered to be "in lock" when the phase offset is detected near 
zero for the center group, positive (data is leading the SBITCK) for left group, and negative (data is lagging 
the SBITCK) for the right group. 

BIT SERiAUZER (FIG. 12) 

es 

With reference to Fig- 12. the circuit for the bit seriafizer 30 of Fig. 22 is disclosed which serializes the 
parallel data input and which provides the "Compensated" clock called BITCK. Parallel data TDATA (4:0) 
are latched into the input register and are valid at TD1 through TD5 at the rising edge of DL before the 
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rising edge Of BT1. TD1 tfirough T05 are complements of TDATA (4:0). Nod© 33 at tha input of invertor 33" 
is pulled high by a weak-drive device 35 if none of the MOS switch SW1 - SW5 are on. When the pulse of 
BT1 arrives, it turns on MOS switch SW1 for the duration of 1/2 Tb. Since output TD1, to which SW1 is 
connected is low, MTR or node 33 is pulled low for 1/2 Tb. This causes T-FF 32' to produce an output 
6 transition of SERO at 12. A half bit later BT1 returns to low, turns SW1 off. causing XTF» on node 34 to 
return to zero. This sequence is repeated as a gated readout of the register. If one of the bits in the input 
register is zero, no transition will occur at output SERO for that bit This operation is more fully described in 
my earlier copending application, entitled "Digital Data Sertalteer and Time Delay Regulator", Serial Number 
07/801.312 filed June 19. 1932. AMD Docket A885- 

to 

DELAY UNITS (Fig. 13a) 

The delay line 21 of Fig. 2a which provides the time ruler signals BT1-BT5 is made up of a plurality of 
cascaded adjustable delay units 21 each of which nominally provide a delay of 1/2 Tb. Rg. 13a shows that 

»5 the adjustable delay unit 2V Is preferable made up of two parts: a course step unit 211 and a fine step unit 
212. The course step unit 211 Is formed by two parts: a fixed-stage delay 221 and a variable-stage delay 
222. There are k stages of delay cell 250 (di tJirough dk) In the fixed-stage delay 221 and m stages of 
delay cell 250 for the variable-stags delay 222. One of tha embodiments of delay cell 250 is shown In Fig. 
13b. which employs 3 inverters. The delay from IN to OUT is defined as T S iop. while the delay from IN to 

so DOUT is defined T^,. The delay from the Input of the delay cell DV1 (223) to the MUX 225 output DLYM of 
the variable-stage delay 222 depends on the digital command code on fine 232, which selects only one 
path of MUX 235 from MX1 through MXM to DLYM. Assuming that delays in MUX 225 from MX1 through 
MXM to the output of the MUX 225 are all the same (Jmod. and the digital command code selects the path 
of MXI to DLYM, or ff i delay stages are selected in the variable stage delay 222, Hie total delay of the 

25 course step unit 21 1 can then be obtained: 

+ «cw 

ao = T mt + '0",^ 0=1, m) 

where T^, i3 the "fixed" part of the delay unit, or its intrinsic delay. The value of parameter k and m are 
chosen to guarantee that under all processing and operating conditions that the unit delay element 

35 adjustable range is large enough to be able to produce the nominal delay required (i/2Tb). T nttp , the delay 
of the delay cell 250, Is the base delay or the minimum adjustable delay of the operation which determines 
the resolution of the operation. T lrrt , is the Intrinsic delay of the delay unit which wBl determine the up-iimrt 
of the operating frequency. The delay unit 21 could be built by just using the "course" step unit itself if 
both Tggp and T^, are satisfactory for the frequency targeted. The addition of a fine step unit 212 can be 

40 beneficial for both higher frequency operation and for lower frequency or larger frequency range operation. 
For the former, a even finer adjustment than T^ is achieved in the fine step unit 212 to improve the 
resolution to reduce the timing error. For the later case, T 8tep can be chosen a larger value (1 Ox for 
example). Fine step unit 21 2 can use exactly the same structure as the course step unit 21 1 with its base 
delay cell delay being a finer value (one 10th of T^ for example), or rt can be built with a different 

45 structure. In any case, ft Is the digital command code on bus 232 that also controls the delay of the fine 
step delay unit 212. Assume tor this example that a coarse step adjustment is applied to each delay 
element If a total of 10 (2N =2x5 = 10) coarse step adjustments are applied to the delay unit at the same 
time, a total delay range adjustment of lOxT^p will result for the whole delay line. Another requirement for 
the delay units Is a matched signal rise-to-rise and fall-to-fall delay to ensure a non-distorted pulse width 

60 while a pulse Is traveling through the delay line. Inverter 213 is used for inverting the polarity Of the Signal 
from DLYIN 210 to DLYOUT 220, so for every other delay unit (or a bit time delay) the signal is not inverted 
and that any mismatches in the rise and fall delays inside the delay unit will be canceled. 

With reference to Rg. 2a. the digital command from the Up/Down Shifter 00* on line 1000 controls each 
of the delay units 21 and 21' such that the delay of each equals to half of the bit period (1/2 Tb). accurate 

ss to * one step in the delay unit, provided that all the delay units are identical and that proper codes are 
chosen at any particular instance. 

With reference to Fig. 2a and Fig. 14a the delay tine 21 detection is performed by "sampling" the level 
status of the delay units by a registration process which looks at each monitored node Ol delay units from 

9 
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BT3 through delay units 501 through 506 of phase detector and of post delay 60'. The monitored node is 
chosen in such a way that the time delay between any two consecutive nodes in any operative condition 
should be less than 1/2 Tb. Sampling Registers 70' samples the levels of each monitored node at the rising 
edge of the LCLK to locate the position of the previous DTRCK rising edge. Alternatively, the task of 
5 locating the position of the previous rising edge could also be accomplished by sequentially setting 
corresponding RS Rip-Flops which are connected to each of the monitored nodes upon the arrival of the 
pulse at each monitored node. At the occurrence of the DTRCK rising edge, the RS Flip-Rop status can be 
interrogated to determine the position of the previous rising edge by locating the border between the set 
and non-set RS Rip-Flops. 

70 With respect to the embodiment of Fig. 14a. if the delay of the delay units is very close to the nominal 
value of 1 12 Tb, when a rising edge of DTRCK arrives at input to Delay Compensation 52', the previous 
TRCK rising edge should be somewhere around or near the nodes at the interface of delay line 21 and 
phase detection delay 60', having been delayed the full byte clock period or the delay amount of 2N delay 
units by the delay line 21. A level change indicating a rising edge position Is observed in this region when 

Tfi the levels of the monitored nodes are latched into the D type Flip-Flop registers clocked by DTRCK. This 
region Is where the timing or phase difference between the sampling clock DTRCK on line 14 and the 
various delayed copies of the sampling clock DN(j:l) needs a dose examination, and it is called the 
detection window. Assuming the command code to the Delay units is set correctly such that the delay of 
the delay units are exactly 1/2 Tb, we define node WCN, Fig. 14a and Fig. 14c on the output of delay 503 

so in the phase detection delay 60 to be the "window center node" where the transition is to occur if the 
delays are correct when sampling takes place, if the delay of the delay units 21 is not exactly 1/2 Tb, then 
the transition will be detected on either the left side (indicating "early") or on the right side (indicating 
"late") of the window center. 

The same type of delay cells usBd in delay unit 21 is also used in the phase detection delay 60' (501 

25 through 508). 

We group the inputs to the sampling register 70' from different nodes of the delay line 21 , from Phase 
Detection Delay and Post Delay 60' in a bus signal and call it DN(j:1). The outputs from the sampling 
register 70' are the sampled level status for the above nodes and are called DSTQ:1). We can show that a 
B ...ill..m000..." type of this status code can be expected near the node where a "0 -> i" transition is 

so taking place when it is sampling. A "...001 110000—" type of status code is seen if the signal pulse width is 
relatively narrow in respect to the delay amount of the delay line. Note that if the time delay interval 
between any two adjacent nodes is smaller than the pulse width in any operation conditions, the pulse will 
be "buried" between the two nodes and the transitions of the pulse will not be seen. The resulting status 
code in this case will be "...00000...". Assuming that the command codes are proper, the WCN is the node 

35 around which a level change or transition occurs when the sampling register 70" latches the level status into 
the DFPs. In the event that temperature change or supply power noise causes delay variations so that the 
delay of the delay units 21 is no longer equal to half of the bit period, or the total delay of delay line 21 is 
no longer equal to the byte period, the transition detected win be on one side of the WCN, giving 
indications whether the delay of the delay unit 21 ' is shorter or longer than the nominal value. 

40 Fig. 14c shows the waveforms at the input of the sampling register 70" Fig. 14a, and the status is 
sampled for different timing conditions. Line 531 shows the status when the delay units are "calibrated" to 
their nominal value or the delay line Is "In-tock." Line S32 shows the status when the sampling dock 
DTRCK is "early," which is equivalent to when the delay of the delay units is larger than their nominal value 
such that the previous rising transition or the delayed copy 561 has not arrived at WCN. Line 533 is for the 

-ra case the sampling clock is "late" so that the transition 561 has passed WCN. The sampled status codes are 
shown in the matrix of Fig. I4d. In the "in-tock" condition column, the transition or level status change 
occurs within -0.5 and + 0.5 of WC delay unit range (552). In the "early" condition column, the transition 

occurred between node DD_2 and DD_1 (indicated by a level difference between SD 2 and SD 1, 533) 

when the sampling clock is sampling. This Indicates that the delay of the delay unit is larger than the 

eo nominal value and that the control should decrease the delay of the delay unit In the "rate" condition 

column, the transition occurred between node DU 1 and DU 2, indicated by a level change 553 between 

SU 1 and SU 2 when it is sampled. This condition should cause the control to step up to Increase the 

dBlay value of the delay unit to move the transition back toward the WCN. If the delay Of the delay units is 
very dose to its nominal value of 1/2 Tb, there is a strong probability that signal at WCN will be making a 

55 transition when DTRCK is sampling, 561, Fig. 14c, causing the flip-flop to enter the metastable state. 
However, since the outputs from the flip-flops DSTQ:1) are sampled ones every byte, it can allow several bit 
period times for metastable flip-flop resolution, adequate for the output to be settled at one Of the two logic 
states. The addition of the phase detection delay 60' to the delay line 20 is to provide a more detailed look 
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at the window center position. The additional delay caused by delay units 501 through 503. as well as the 
setup time of the D flip-flops in sampling register 70, is to be compensated by the delay compensation 
block 52'. 

The calibration of the delay unit in operation, includes two modes, the initial "power-up" mode and the 
5 norma] operation mode. These modes are more fully described in copending application 93304079.2. 

Fig. 14b is the block diagram of one embodiment of the Ruler Adjust Decision Circuit 80* (Fig. 2a). The 
inputs to this circuit is the status code outputs from the sampling register DST(j:1) which are provided by 
the sampling process once every byte cycle. Complex gate 84" performs the function of rising transition 
detection. Only if the two inputs to the XOH gate 82 are in different logic levels and the one on the left Is a 

to logical one, then the output of the AND gate 83 will be high, indicating a "0 -> 1 " transition or a rising edge 
transition detected between the two consecutive nodes 801 and 802. There should be only a single "1 " in 
the XDST Q:1) input bus lines for any sampled code since there is only a single rising edge in the delay line 
21 per sampling eyde, if the delay units are calibrated to or dose to nominal value of 1/2 Tb such that the 
whole delay fine 21 delays a time equal to the byte period. If more than one "1" exists In XDST(j:1). a 

is "noise" condition, or a 'locks d-on2x" condition (meaning that the delay value of the delay units are close to 
Tb instead of 1/2 Tb) is indicated. The Status Decode Logic 85' decodes the status code XDST(j:i) Into a 7- 
line detection code on fine 851 through 857. 

Hg. 15a and 15b illustrates the function of the Status Decode Logic 71. Line 800, Fig. 15a indicates the 
"window center" position. If the detected transition is within the region of -0.5 and +0.5 delay cell from the 

20 window center or In the zone D, E. XWC is "1 ". If the detected transition is in the region of one delay cell 
on left (right) side of the window center, then XDl (XUl) is logic "1 ". If the detected transition is within three 
delay cells on left (right) of the window center, XD3 (XU3) becomes logic 1". If the transition is in the 
region one delay cells away on left side of the window center XD (XU) is a logic "1 ". Note the overlaps of 
the regions. The status of this detection code combination gives the information as where the transition is 

2s detected and how fast this detected transition is moving from byte cycle to cycle. Both are useful when 
implementing a digital filter function. The Adjust Update Control block 72', Fig. 14b, is not 3 pure traditional 
combinatorial logic circuit because the output of this circuit is dependent on the current input code 
combination, and on the history of the input/output Also, in high speed data communications, the byte clock 
rate is in the range of ten's of MHz. As a result, the status sampling rate is much higher than the delay 

so variation rate caused by normal temperature drift or supply voltage variation. It is necessary to "filter" out 
the digital "noise" In the status codes, and to ignore the temporary delay changes caused by high 
frequency power supply noise. Different digital structures can be used to implement file filter function. 
Generally, the filter function should remove or ignore rapid phase variations caused by rapid electrical 
noise, while passing the slow and steady phase changes caused by slow delay changes to the controls of 

x the Up/down Shifter 90 to make proper command code adjustment to maintain constant delays of the delay 
units. 

Fig. 15b shows a function table for Implementing the Adjust Update Control 72, Fig. 14b. Columns 90i 
through 907 are the status code inputs from the Status Decode Logic which is sampled for every byte 
clock. Columns 808 through 910 are three intermediate variables. Outputs are shown in column 911 through 

to 914 where UD and ROTEN are the control outputs to the Up/down Shifter for delay adjustment R0TEN = 
1 enables the adjustment when control update is necessary. UD gives direction indication as how to make 
adjustment (UD = 1 for step up and UD=0 for step down) to increment or decrement the delay of the delay 
unit FS is the switching control between Fast adjust mode and Slow adjust mode. RESTART = 1 enables a 
reset process in the event of a consistently detected error status. 

45 With reference to 15b, row (a) shows the situation when the detected transition is in zone H, Fig. 15a 
right of XU3. as in the case of power-up. This is the zone for inc-dly-fast mode (UD - 1 and FS - 1). 
meaning delay of the delay unit needs to be incremented with large (coarse) step to be increased fast This 
fast mode corresponds to a smaller loop time constant as in the PtL case, in row (b) and (c). the detected 
transition is still on the right side of the window center, so incrementing the delay unit is still necessary. 

so However, assuming the transition is already within + -3 delay units, or + - 3xT 4 ^, from the window center, 
In zone 3 in Fig. 15b, no fast adjustment is to be made in this instance since each such adjustment of a 
delay change, of 5xTstep Is too large and may "overshoot" to zone A on other side. This is the zone for 
INC-DLY-SLOW mode (UD = 1 and FS = 0). meaning delay unit incrementing is made with fine steps which 
correspond to a large loop time constant as In the PLL case. Row (d) and (o) Indicates the IN-LOCK 

55 condition, which means that the total delay of the delay line is within +- 0.5 T^ comparing with the 
reference byte clock. No adjustment is necessary for this condition (ROTEN =0). Row3 (f). (g) and (h) 
indicate similar situations as in rows (a), (b) and (c) except that the delay of the delay unit needs to be 
decremented (UD = 0) instead. Fig. 15a provides a means to understand the overall operation of the adjust 
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update control 72 relative to different zones in which the transition i3 detected. Row (i) and (j) shows error 
status in which either transitions are detected on both sides of the window center, or none is detected- 
These error status could be the result of digital noise in the status sampling process, and should therefore 
be ignored. The RESTART becomes active onty if the errors are persistently detected- Another error status 

5 could result if the adjustable range of the delay unit is large enough that it delays time Interval of Tb instead 
of 1/2 Tb. A IN-LOCK condition could be erroneously resulted sine© one transition should be detected in the 
middle of the delay line 20, while a second transition can be detected around the window center. This is 
indicated in row (k) of Fig. 1 5b as a LOCKE D-ON-2* condition, similar to the case In PLL when the PLL is 
locked on the second harmonics of the reference dock. One solution to this problem Is to restart the delay 

ro calibration process by setting the control code to the delay units in such a way that all the delay units select 
minimum delay value as the initial setting. In doing this, the transition is first detected on the right side of 
the window center to avoid the LOCKE D-ON-2x condition. Note the "*" in the table in Fig. 15b, which means 
the controls become activated only if a certain condition is satisfied. For example, if the last value for LID is 
"0" (meaning delay decrement) while the current status code changes to XDN =0 and XUP = 1 (meaning 

rs delay increment), one option is to change UD immediately to "1 " If It Is In the slow or fine adjustment 
mode, since only a small step adjustment can be made regardless of how big the delay adjustment is 
indicated by the detection, just like a fast spike on the phase detector output is "smoothed* by a large 
capacitor In the low pass fitter. On the other hand, if it is in fast adjustment mode white a sudden change in 
direction is indicated, further detection results may be needed before a reverse direction adjustment Is 

20 made. In any case, these type of digital filter functions can be implemented to make correct delay 
adjustment with less timing error or resulting jitter. 

DIGITAL PHASE SHIFTER (FIG. la, FIG. 16-21) 

26 The digital phase shifter of my Invention, as indicated above, must act as if it were an infinitely long 
delay line to accommodate the fact that there may be a slight and constant difference between the 
frequency of the local clock of the receiver and frequency of the transmitter. 

The phase shifter utilizes a tapped delay line and a MUX to form a digitally adjustable delay to provide 
phase shifted copies of an input signal. An up/down control signal controls the Increment or decrement 

at> selection of the delay so that a positively or negatively phase-shifted copy of the input signal is selected 
each time the selection is enabled. The digitally adjustable delay is able to provide phase shifting in 360 
degree range in all operating conditions. An on-the-fly Phase Shift Range Calibrator detects the delay 
variation due to temperature or power supply variation and determines the output tap known as the End 
Stage Position (ESP) In the digitally controlled delay from which the input signal is delayed exactly 360 

as degree. In the event that the up/down control indicates that a constant positive (up) phase shift is needed, 
the Rotating Phase Shift Control moves the delay path selection Dne tap up every enabled dock until the 
ESP is reached. If a further up-shift is stall indicated, the phase shot control will "wrap around" to select the 
1st tap as the delay path such that a positive phase shift step is made in relative to the last selection where 
the control selects the tap output stage indicated by the phase range calibrator as the ESP. The same 

w operation is performed when the up/down control indicates that a negative (down) phase shift is needed- 
The rotating phase shifting requires only an up/down and an enable signal from the phase detection and 
adjust decision logic. The digitally controlled or selected delay is adjusted one step at a time and, by virtue 
of its construction, is designed to be able to "freeze", or hold, a position in case there is no data to 
compare. The previous phase adjustments are also able to be "memorized" by the control logic which is 

45 used to send the up/down control signal, such that certain adjustment "patterns" or some modulating 
frequencies from noise sources may be recognized and filtered by the control logic. 

With reference to Fig. 1b the digitally adjustable delay 50 receives BJTCK 16 and performs phase- 
shifting on it providing the output from SBiTCK 16, which is the phase-shifted copy Of the original Signal 
BfTCK 16, as shown also in Fig. 2b. Waveform SBITCK 16* is a phase shifted copy of waveform 16 by any 

so desired amount The phase shift is controlled by the parallel digital control code 68 from the Rotating Phase 
Shift Control 60. Rotating Phase Shift Control 60 only requires a signal UPDN 89 and UPDNEN (up/down 
enable) 68 from the circuit which informs this circuit to provide positively increasing phase shift (UP) or less 
or negatively increase phase shift (DOWN) form the current phase shift amount UPON 89 Is "1 " when 
further positive phase shift is needed, and Is "0" when negative or lass phase shift is desired. UPDNEN 88 

88 Is the adjustment enabling signal. UPDNEN is a "i" rf the adjustment needs to be performed upon the 
arrival of the dock rising edge, and is a "0" if current phase shift is to be maintained frozen. Phase Shift 
Range Calibrator 70, Fig. 18. constantly monitors on-the-fly the delay change due to variations In operating 
conditions, and sends a parallel digftal code 79 to the rotating phase shift control 60 to notify the position Of 

12 
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the stage in the delay line where the original signal is delayed a full cycte or 360 degrees. RDY 36 are the 
signals from other circuitry to signify that the calibration is ready to be performed, res 32 is the reset 
signal for resetting the controls during the power-up sequence or in other occasions when a re-start is 
necessary. IRG 361 is the In-Range signal when the phase comparator 73 finds signal PHO 711 and PHK 
s 712 are almost phase aligned. Fig. 18a. 

Fig. 18 shows one embodiment of the Digitally Adjusted Delay which consists of a series of cascaded 
inverters 509 and MUX 541 or other fast path selector. The input to the MUX are the taps from the delay 
chain for every other inverters via lines 51 1, 512, etc. for a total of N pairs of inverters. The control inputs to 
the MUX, S(1) through S{N) 68, are outputs from the Rotating Phase Shift Control 60, Fig, 19 and Fig. 1a 
io which are ail "0's" except one "1 " which selects a corresponding delay path. The control inputs S{Q), which 
would select input on line 510 if it is a "1 is tied to "0" to disable the selection, tf stage DK. Fig. 16, is the 
ESP, i.e., Is found to be the stage at which signal BfTCK is delayed or phase shifted a full 380 degrees, the 
delayed signal at the output of the stage DK on line 515 should be coincided with that at the input on One 
16. This can be better seen In Fig. 17, the timing diagram of the delay, where the output signal SBTTCK(K) 

is of 515 is coincident with signal BITCK 16 on line 16- A next positive phase shift step can be achieved if the 
selection moves to stage 1, shown as SBITCK(l) 51 1, where a phase shift 411 equal to the delay of two 
inverters is shown. It Is seen that an endless positive phase shift can be achieved if the digital control 68, 
Fig. 16 steps through from S(1) to S<k) then rotates or "wraps around" to S(1) and so on, where S(K) 
corresponds to the 360 degree phase shift end stags or the ESP- in an integrated circuit format the delay 

20 of the inverters, in the Digitally Adjustable Delay 50 varies with process, environment and power supply. For 
a given cycle time or 360 degree phase shift, K could be a different number due to above factors. To 
acquire accurate 360 degree phase shift rotation, the delay of the inverters have to be constantly detected 
so an accurate ESP location can be maintained for calibration. 

The block diagram of the Phase Shift Range Calibrator 70, Fig. 1b, is shown in Fig. 18. Two digitally 

sb adjusted delay's 50A and 50B, constructed exactly the same way as that of 50, Fig. 16, are used to 
calibrate the delays to find the ESP location. The controls to delay 50A are set such that it selects the first 
path of the delay by connecting S(Q) to "1" while the rest to "0", thus acting as a "dummy" delay. The 
selection for delay 50B is controlled by parallel digital code 79 from the delay adjust up/down shifter 75 
which Is in turn controlled by the up/down signal UD 77 and adjust enable signal UDEN 78 from the phase 

30 comparator 74. Controls S(J) through S(N) determines the calibration rotation based on the assumption that 
at least J stages are required to delay a full cycle for a given signal in any process or operating conditions. 
At initial power-up, the control may be reset to a p re-specified code to select a delay path, which is often 
not the correct path to delay the full cycle or 360 degree of the signal. The phase difference of delayed 
signal at PHO 711 and PHK 712 is detected by the phase comparator, and an UD signal is generated to 

as signify the delay adjust control up/down shifter to add or subtract delay stages to reduce the phase 
difference. If. after a number of adjustments, signal delayed by delay 5 OA at 71 1 and signal delayed by 50B 
at 712 are in phase, the delay path of delay 508 is the path that provides a 360 degree phase shift o the 
original signal delayed by 50 A. The code 78 for selecting thie 360 degree delay path contains a single "1 ". 
the position of which defines the ESP location, or stage DK corresponding to C(K). The difference of the 

4a delay path in 50A and 50B is the delay of pure K delay units consisting of 2K inserters, since the other path 
delay or the "intrinsic" delays for the two are the same and should be concealed out The output code 79 
(C(J) through C(N)) contains the ESP information and is used to inform the rotating phase shift control 60, 
Fig. lb of the ESP location. The phase comparator circuit 73, Fig. 18B could be any phase detector circuit 
which detects the phase difference of signal PHO at 711 and PHK at 712, and which ensures that correct 

45 delay adjust control signal UD77 and UDEN 78 are generated such that phase alignment of PHO 711 and 
PHK 712 is eventually achieved and that the phase difference is 380" or the delay difference is a Ml cycte 
of signal BITCK Fig. 18B is a preferred embodiment of the phase comparator 73 for providing the delay 
adjust up-down control signal UD 77 and UDEN 78. The phase comparator circuit 73 is more fully 
described in the simultaneously filed related application, "Digital Variable In- Lock Range Phase Compara- 

so tor", which has been incorporated herein by reference. 

Fig. 19 is a detailed block diagram of the Rotating Phase Shift Control 60, Fig. 1b. ft contains flip-flop 
register stages ST(1) through ST{N) to form the control of a bi-directional, 360 degree phase range rotating 
shifter, where each stags controls the selection of its corresponding delay path in the digital adjustable 
delay 50, Fig. 1b. tf its output is a "1 ". the delay path or the corresponding MUX in digital adjust delay 50 is 

ss selected. After the 360 degree phase shift range calibration, a bus having code 79 is generated from the 
Phase Shift Range Calibrator 70, Fig. lb. as described above. The code is shown in box 601 where the 
location of the "1 " is indicated by an arrow 603. In the example shown, stage ST(K) corresponds to the 
ESP. Positive or "up" incremental phase shift adjustments are executed in a rotational sequence indicated 

13 
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by loop 604. A "1" (indicated In the output code shown in box 602) is shifted to the right at each enable 
clock CK. This continues until the "1 " is shifted to the ESP stage ST(K) such that S(K) becomes a "1". This 
means that the signal has been phase- shifted for 360 degrees by this code selection. A further positive 
phase shift request from the decision circuit should cause the "1" to be wrapped around to stage 1, or S(1) 
5 will become a "1 and then right-shift again. Note that this rotating shifting takes place in the range of ST- 
(1) through ST(K), as the stage ST(K+1) Is "Hocked" by the ESP. In practice, when the "1" is shifting 
While the Stage ST(i) is away from the ESP, the detection or the exact location of ESP Is not Important, and 
the calibration to find the ESP needs only to be made in relatively long time interval, or when the "1" is 
approaching the ESP. The error of the 360 degree range calibration due to limited adjustment resolution will 

70 cause an uneven adjustment step at the "wrap around", and results in timing jitter on the "phase aligned'' 
or "recovered" clock in terms of data recovery. Negative or "down" dec re mental phase shift adjustments 
are executed In opposite direction. Note in Rg. 19, that stages ST(1) through ST(j-i) 62 can be simpler in 
structure than those of stage ST(J) through ST(N) 63. because there is no "wrap around" needed for those 
stages. As explained previously, in any operation conditions, there is always some minimum number of 

T6 states required to delay a full cycle of a given frequency signal. Apparently also, the rotation for finding the 
ESP location needs only be performed for stages ST{J) through ST(N). The dock SBfTCK 16' for shift 
operation Is gated by UPDNEN (S. Proper delays 61 are included to ensure that the signal transitions are 
completed from the stages where the delay path is switching to maintain the signal integrity. Line 60S feeds 
back the output of S(1) to the ESP stage selected in the event that a "down" or left shift is needed while the 

20 "1 " has been left-shifted in stage ST(i). A "down wrap around" will have to send this "1 " back to the stage 
where the ESP is located. In normal operations, when a "0" is in ST(1) while a down shift Is requested, the 
"0" is also wrapped around to the ESP stage via the same One 60S. Outputs WA(I) of all the register stages 
63 are ORed into signal WA or as S(0), which is the input to stags ST(1) in the "up" or right-shift 
operations. In any event, only one "1 " is allowed in all the outputs S(1) through S{N) to ensure that a single 

25 phase shifting delay path is enabled. No °1" should be in outputs on right side of the ESP stage ST0Q. 

Fig. 20 shows a preferred embodiment of the register stages in Fig. 18. Register stage 62 is one of the 
simpler register and is used for stages ST(1) through ST(J-1 ). Depending on whether UPON 69 is a "1 ° or a 
"0", MUX 612* selects input from the output on left (previous) stage on line 625', or from the output on right 
(next) on line 611'. When the gated dock CLK 16" arrives, a "up" or "dawn" shift is performed. Register 

so stage 63 is the type of register with combinatorial logic used for stage ST(J) through ST(N), Rg. 19. 
Register stages 63 are the same as register 62 with addition to register function 62 of a logic block 614. It 
takes the output of MUX 612 on line 613 for selecting inputs from either the previous stage S{M) on line 
625, or from the next stage S(M) on line 611, depending on control UPON on One 615- Other inputs to the 
logic are: the inputs 79 from the phase shift range calibrator 70, Fig. 1b and Rg. 19, both for the current 

as stage C<l) on line 617, and for the previous stage on line 616; and input from stage one S(i) on line 60S. 
The logic for Rg. 19 is easier to understand by referring also to Rg. 21. which contains the equations and 
its truth table tor the function. For each register unit STO). there are two outputs: S(l), and WA(i). S(l) 
controls the corresponding input of the digital adjustable delay 50, Rg. 1b WA(l), 608 is the feedback signal 
to stage one ST(1). In the truth table shown, row 701 through 704 is when "up-shift is needed (UPDN - 1), 

-» while row 705 through 708 is when °down°-shift is required. In row 701 and 702, C(l) is a "0", meaning that 
stage ST(I) is not the ESP stage, so normal shift is performed. The status of the previous stage S(M) is 
transferred to the D input of the D flip-flop D(l), and win be latched in S(i) upon the arrival of the clocking 
edge Of CLK 16", while input to Stage ST(t) or WA(I) is always a "0". Row 703 and 704 is for the Situation 
when ST(I) is the ESP stage (I = K, C(l) - 1) or the end stage from which the signal has already been 

4e phase-shifted 360 dagrass. In row 703, if the previous stage is a "O", the "0" is wrapped around to ST(1) 
sincB WAfl) is a "0", while the current stage is the ESP (C(l) =C(K) = 1), the "1 " is not permitted to move 
to stage ST(I + 1), but rather, has to move to the first stage ST(1). A "wrap around" Is performed here by 
making 8(1 + 1) a "0" upon the next clocking edge of CLK 16". This can be seen on row 704'. where the 
current stage is the stags following ESP, since C(H) = 1. During cycle n. if the ESP stage is "1" {S(l-1 )- 

so ( „)=1), for next cycle (n + 1), the current stage will be set to "0" (S(I)t»+i)=0), preventing the "1" from 
shifting beyond ESP. Meanwhile, at the docking edge of CLK 16" on cycle n, both WA0) (n) , and S(I) M have 
been set to "1" (row 704), i.e. S(1) will be set to "1 " upon the arrival of clocking edge of cycle (n + 1), thus 
completing the "wrap-around" "1" of ESP to stage one or S(1). Row 705 through 708 is lor the "down" shift 
operations (UPDN = 0). Row 705 and 706 is similar to the case of row 701 and 702 except that they are 

ss left or down shift operations. Row 707 and 708 Is the sltuaoon when ST(l) is the ESP stage or l = K. This 
time, whatever Is In stage ST(1 ) will be "wrapped around" to ST(l) (l = K). WAjl) is "dont care" since these 
are down-shift operations. The equations for S(l) and WAfJ) are shown on top of the truth table. At power-up. 
a reset is mads such that all stagBs are "0" except ST{1) which is a "1", so a minimum phase shift Is 
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chosen. Meanwhile, the phase range calibrator resets C(J) to "1 * while others to "0". so the stage J Is the 
ESP to start. The ESP is expected to move "up" to its correct position after the phase range calibrator 
completes certain number of calibration cycles, at which a IN- RANGE signal IRQ 381 is generated, and is 
sent to the Rotating phase shift control block 60, Rg. 1A, to Initiate phase adjustment process- The UPDN 
5 signal for the delay path selection should choose to up-sWft at the start, so a "1" at outputs Sfl) is moving 
"up". In any case, no "1" is allowed to be at the outputs beyond the ESP. Equation (A) is for stage 1 
through stage J-1. while the equation (6) and (C) are for stage J through stage N. Circuit implementation 
can be easily achieved by using conventional combinational circuit design methods or by using a logic or 
circuit synthesizer. 

TO 

Claims 

1. An all digital method for tracking and aligning the phase of a binary receiver reference signal to the 
phase of incoming serial binary data stream, wherein said incoming serial binary data stream has a bit 

rs rate which is derived from a distant asynchronous oscillator having a specified nominal frequency and 
tolerance, comprising: 

sampling the transition edge positions of transitions of said incoming binary serial data stream; 

digitally analyzing said transition edge positions to determine the average edge position of the said 
incoming binary serial data stream with reference to a local reference signal by comparing said 
20 transition positions to a phase adjusted local reference signal having the same said specified nominal 
frequency and tolerance as said distant oscillator. 

2. The method of claim 1 Including providing digital command signals to shift the relative phase of said 
binary local reference signal to reduce the phase difference between said Incoming serial binary data 

2S signal and said binary local reference signal. 

3. The method of claim 2 wherein said step of providing digital command signals to shift the relative 
phase of said binary local reference signal includes the step of compensating said local reference 
signal by providing a broadband frequency spectrum comprised of a plurality of time ruler elements 

so having different base frequencies. 

4. The method of claim 3 wherein said plurality of time ruler elements includes a first and second type, 
and wherein said first type includes a plurality of cycles of equal period and said second type Includes 
one cycle of a different period so that the average pulse rate of said local reference signal is equal to 

35 the pulse rate of said incoming serial binary data whereby data recovery can be accurately undertaken 
even in instances of run length of thousands of unchanging value bits in said incoming serial binary 
data. 

The method of claim 2 wherein said step of providing digital command signals to shift the relative 
phase includes the step of delaying said local reference signal in a multistage delay line; and 

generating an end stage pointer (ESP) indicating which stage of said multistage delay line would 
provide exactly a 360 degree phase shift. 

& The method of claim 5 wherein said step of providing digital command signals to shift the relative 
<s phase Includes effectively connect In said esp to said first stage of said delay line to wrap the stage 
selection for phase shift of said reference signal back to the first stage of said local delay line whenever 
the delay needed exceeds 360 degrees in phase. 

7. A method for adjusting the phase of a reference clock to track incoming received serial data including a 
so method for analyzing the edge position of binary transitions In said received serial data with respect to 
said reference clock and determining the average edge position of said received serial data, compris- 
ing: 

a. Initiating the generation of a time ruler pulse train upon arrival of transitions of said received serial 
data by launching a pulse into a serially connected plurality of unit delay elements, each said unit 

65 delay element having an input terminal, an intermediate terminal and an output terminal; 

b. connecting the input, intermediate and output terminate of said unit delays into a edge distribution 
analyzer circuit for sampling and analyzing said pulse train; 
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c. sampling each said input intermediate and output terminal at the occurrence of a selected 
transition of said reference clock; 

d- combinatoriaJJy interconnecting the sampled signals and said reference clock to provide a running 
count of the difference between the number of receiver data transitions which are early (E) versus 
5 late (L) with respect to said selected transition of said reference dock; and 

e. analyzing said early versus late difference count to generate a correction signal for adjusting the 
phase of said reference dock to drive said early minus late (E-L) difference toward zero. 

a. Hie method of claim 7 wherein said combinatorial interconnections also provide a running count of the 
io difference between the number of receiver data transitions which are earlier than a first predetermined 
time and the number of data transition which are later than a second predetermined time, said 
difference being called 1-0; and 

analyzing said difference count to generate a correction signal for adjusting the phase of said 
reference clock to drive said 1-0 difference count toward a positive maximum value. 

75 

9. The method of claim 7 wherein said (l-O) count corrects for 180 degree phase shifts and said E-L count 
corrects for 60 degree phase shifts. 

10. The method of claim 9 wherein the minimum E-L count and the maximum l-O count are a measure of 
20 the phase alignment in said received serial data. 

11. An all digital phase lock loop comprising means for sampGng the transition edge positions of incoming 
serial binary data; 

means for generating a broadband time ruler signal; 
2s means for adjustably shifting the phase of said broadband time ruler signal; 

means for comparing said shifted broadband time ruler signals to said edge positions of said 
incoming serial binary date and analyzing the results of said comparison, said means for adjustably 
shifting said broadband time ruler being responsive to said analyzed comparison. 

30 12. The apparatus of claim 11 wherein said means for comparison provides a statistically averaged 
transition edge position and said means for adjustably shifting causes the said phase of said broadband 
time ruler to track said statistically averaged transition edge position . 

13. The apparatus of claim 1 1 wherein said means for generating a broadband time ruler Includes; 

35 means for causing said broadband time ruler to have a spectrum containing at least a first and 

second base frequency, the first said base frequency being the frequency corresponding to the period 
of said local reference dock multiplied by N, where N is the number of pulses in the compensated 
broad band time ruler; and wherein the said second base frequency corresponds to the period equal to 
the Ti + T2, where Ti = 1/Fr and Ts - 1/Fr-Ft, where Fr equals ttiB frequency of the receiver local 

•ra clock and Ft equals the frequency of the distent dock. 

14. The apparatus of claim 11 wherein said means for adjustably shifting said broadband time ruler 
includes a 380 degree phase shifter. 

45 15. The apparatus of daim 14 wherein said 360 degree phase shifter includes a means to identify exactly 
at which output in a long chain of delay elements outputs that a signal in said delay line is exactly in 
phase with a signal which is propagating in said delay line; 

means to mark said Identified output in said chain of delay element as the end point; and 
means to cause the tap of the first element of said serially connected delay elements to provide 
60 the next output delay when a larger delay than is available at said end point is requested whereby the 
long chain of delay elements is wrapped back at the end point 
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